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ABSTRACT 

ZOBOV (ZOnes Bordering On Voidness) is an algorithm that finds density depressions in a set 
of points, without any free parameters, or assumptions about shape. It uses the Voronoi tes- 
sellation to estimate densities, which it uses to find both voids and subvoids. It also measures 
probabilities that each void or subvoid arises from Poisson fluctuations. This paper describes 
the ZOBOV algorithm, and the results from its application to the dark-matter particles in a 
region of the Millennium Simulation. Additionally, the paper points out an interesting high- 
density peak in the probability distribution of dark-matter particle densities. 
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1 INTRODUCTION 



Voids are an essential com ponent of the cosmic web 
(Bond ^ofman & Pogosvaril 1 19961) of matter in the Universe 
on several-Megaparsec scales. They are fascinating from an 
information-theoretic viewpoint, as a probable component of 
efficient descriptions of large-scale structure in the non-linear 
regime. Voids also provide useful tools for studying cosmology 
and galaxy formation. The way in which matter inside a void 
flows away from its centre holds information about cosmological 
para meters such as the ma tt er density Q m and dark-energy density 
yiA jPekel & Reesl [l994l : iBernardeau & van de Weygaertl 1 19971 : 
iFliche & Triavll2006l). and about t he clustering, if it exists, of dark 
energy jlVlota. Shaw & Silldl2008l) . Also, measuring the evolution 
of void ellipt icities can give co nstraints on the dark-energy equa- 
tion of state dLee & Parkll2007t) . The existence of large voids has 
been invoked to expl ain the 'cold spot' on the Cos mic Microwave 
Background (CMB) teudnick. Brown & Williamsll2007l): anoma - 
lously low large-angle CMB anisotropics dlnoue & Silkl l2006f) : 
and even the a pparent acceler a ting expansion of the U niverse (e.g. 
Moffat 2006; Celeriel [20071 : lAlexander et alj l2007h . Voids are 
also relatively pristine laboratories to study galaxy formation and 
evolution, con taining t h e mos t isolated galaxies in the Universe. 
For example, IPeeblesI fcOOll) has pointed out that there seem 
to be fewer galaxies in voids than cosmological simulations 
predict. Even if this is not a discrepancy with the underlying 
ACDM cosmology, it contains valuable information about galaxy 
formation. 

Despite these useful features of voids, they are not currently 
in the forefront of cosmological probes. One reason for this is 
that there remains no standard definition o f them. The Aspen- 
Amsterdam Void-Finder Comparison Project dColberg et ai]|2008l. 
AAVFCP) makes an important first step in exploring differences 
and similarities in void definitions, but still, a consensus about how 
to define a void does not exist. Here I present ZOBOV (ZOnes Bor- 



dering On Voidness), a void finder whose features, I believe, are 
appealing enough that it represents a net contribution toward that 
consensus, rather than simply adding another alternative to recon- 
cile with the others. 

Many void finders define voids as sphe res, or unions of a fi- 
nite number of spheres or other shapes (e.g . Kauffmann & Fairalll 
1 199 it iMiiller et alj l200d : iHovle & Vogeievl |2002| ; IColberg et al.1 
2005). This definition has some theoretical justification, since un- 
derdense regions expanding in a ho mogeneous background tend 
to become more spherical with time Eel 1 19841) . Also, this def- 
inition is geometrically simple. However, the real Universe con- 
sists of many underdense regions that collide with each other 
and produc e voids that are often more polyhedral than spherical 
(e.g. Icke & van de Weygaertl 19871) . or even more generally shaped 
( Shan darin et alj|2006h . ZOBOV imposes no prejudice about the 
shape, or even topology, of a void. Some other void finders (e.g. 



El- Ad & PiraJl997l;lAikio & Mahonenl l 19981 : [Plionis & Basilakosl 
2002 1: IShandarin et al.ll2006l:lHahn et al.l2007l:lArag6n-Calvo et al.1 
20071 : IPlaten. van de Weygaert & Jonesl 120071) also define voids 



with no, or only weak, rules about their shapes. 

ZOBOV aims to find voids from a set of points with as few 
restrictions as possible. Conceptually, a ZOBOV void is simply a 
density minimum with a depression around it. ZOBOV has no free 
parameters. However, the ambiguity in void finding, when applied 
to noisy data, must be placed somewhere. With no free parame- 
ters to tune, ZOBOV returns many (indeed, mostly) shallow, hardly 
visible voids. However, ZOBOV measures a statistical significance 
for each void. A physical significance criterion can also be used, 
requiring that a void's minimum density p m m/p < 0.2. This is 
a characteristic density of a void in an Einstein-de Sitter model 
(with only a slightly different density in ACDM), obtained us- 
ing a top-hat spherical expansi on model dBlumenthal et alJI 19921 : 
ISheth & van de Wevgaert 2004). For the abstract problem of find- 
ing voids in a particle distribution, this strategy of returning all 
possible voids, even statistically dubious ones (as long as they are 
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marked as such), seems more satisfying than making an arbitrary 
choice of free parameters. For many applications, though, actual 
use of a ZOBOV void catalogue might require another arbitrary 
choice, about the level of significance at which to accept a void. 
Another philosophical difference between ZOBOV and most void 
finders is that ZOBOV returns subvoids along with voids. 

ZOBOV is an inversion of an 'almost-parameter-free' dark- 
matter-halo finder, called VOBOZ . (VOronoi Bound Zones; 
iNevrinck. Gnedin~& Ham iltor]|200l NGH). The major change in 
ZOBOV is that it looks for density minima instead of maxima. In 
fact, the VOBOZ-ZOBOV algorithm is perhaps better-suited for void 
finding than halo finding. This is because the algorithm typically 
detects highly nonspherical shapes; to get roughly spherical, virial- 
ized haloes, VOBOZ trims their edges with a boundedness criterion 
using particle velocities. It is in this step that VOBOZ loses its pure 
parameter-freedom. 

Perhaps the existing void finder most similar to ZOBOV is 
WVF (the Watershed Void Finder; IPlaten et al] l2007h . Both use 
tessellation techniques to measure densities, and both use the 
'watershed' concept, defining voids with analogy to catchment 
basins in a density field. However, WVF uses several clever tech- 
niques from the field of mathematical morphology to smooth 
the particle density before defining voids, while ZOBOV anal- 
yses the raw, unsmoothed data. There are other methods that 
use tessellation techni ques to find clu sters dRamella et alj 200ll; 
Barkhouse et al. 2006; Sochting et al. 2006; van Breukelen et al. 
20061; iMelnvk, Elviv & Vaviloval |2006|) or voids dGaitel 120051 ; 



Aragon-C alvo et alj|2007l) . 



First I will discuss the ZOBOV algorithm, and then I will dis- 
cuss its application to dark-matter partic les in a region of the Mil- 
lennium simulation tSpringel et al. 2005). Some of these results ap- 
pear in AAVFCP, where they are also compared to the results of 
other void finders. Finally, I will discuss what I feel are the unique 
strengths and weaknesses of ZOBOV, and what could be done to 
improve it. 



2 METHOD 

The ZOBOV algorithm is the same as the first two steps of the 
VOBOZ (NGH) algorithm, except that it searches for density min- 
ima instead of maxima. 



2.1 Particle density and adjacency measurement 

The first step is t he dens it y esti mation at each dark-matter par- 
ticle, using what ISchaad hoot calls the Voronoi Tessellation 
Field Estimator (VTFE). Tessellation met hods for den s ity estima- 
tion are widely used in many fields (e.g. lBrownlll965l ; IOrj[l97^ : 
iBiilow-Olsen. Sackville Hamilton & Hutchingall983h . A goo d ref- 
erence on this topic is provide d by lOkabe et alj d2000h : see 
van de Wevgaert & SchaarJ J20081) for a review specific to large- 
scale structure. The VTFE (along with its dual, the DTFE) gives ar- 
guably the most local possible density estimate that has meaning- 
ful information. The Voronoi tessellation divides space into cells 
around each particle, with the cell around particle i defined as the 
region of space closer to particle i than to any other particle. The 
density estimate at particle i is 1/V(i), where V(i) is the volume 
of the Voronoi cell around particle i. The Voronoi tessellation also 
gives a natural set of neighbours for each particle (the set of parti- 
cles whose cells neighbour i's cell), which ZOBOV uses in the next 
step. 



Figure QJ shows a set of particles in 2D, corresponding to 
galaxies in a slice of the Millennium simulation. Figure[Tp depicts 
the Voronoi tessellation of this set of particles, with the Voronoi 
cells shaded according to area. 

2.2 Zoning 

The second step in ZOBOV is the partition of the set of particles 
into zones around each density minimum. This is done partly for 
computational speed, and partly to compress the information in the 
dataset. A minimum is a particle with lower density than any of 
its Voronoi neighbours. ZOBOV sends each particle to its lowest- 
density neighbour, repeating the process until it arrives at a mini- 
mum. A minimum's zone is the set of particles which flow down- 
ward into it, and a zone's core is the minimum-density particle of 
the zone. Figure QJ shows how ZOBOV partitions the particles in 
the previous panels into zones. These zones could conceivably be 
called voids. Because of discreteness noise, though, many zones are 
spurious, and others are only the central parts of what are picked out 
as voids by eye. Thus, it is necessary to join some zones together 
to form the final voids. 

2.3 From zones to voids 

Zones are joined as follows. Imagine a 2D density field (repre- 
sented as height) in a water tank; see, for example, Fig. 1 of 
IPlaten et al] (2007). For each zone z, the water level is set to z's 
minimum density, and then raised gradually. Water may flow, along 
lines joining Voronoi neighbours, into adjacent zones, adding them 
to the void defined around the zone z. The process stops when wa- 
ter flows into a deeper zone (with a lower minimum than z's), or if 
z is the deepest void, when water floods the whole field. The final 
void corresponding to z is defined as the set of zones containing 
water just before this happens. 

The minimum-density (core) particle of the original zone is 
also the minimum-density particle of the zone's void. Many low- 
significance zones fail to annex surrounding zones as they attempt 
to grow; a zone in this situation has a void equal to itself. The 
density (water level) at which water flows into a deeper zone is 
recorded as pi (z) (I stands for 'link' to a deeper zone). 

Figure [T]l shows the stages of growth that the deepest void 
in the set of particles undergoes. Successively lighter colours shade 
zones added when the density level reaches successively higher lev- 
els. Since this is the deepest void, its last extent encompasses the 
whole simulation, except for the zone with the highest-density par- 
ticle separating it from other zones, in the lower-right corner of the 
figure. This does not mean that other voids are not detected; they are 
subvoids of this largest void. Still, to form voids conforming better 
to intuition, a further criterion could be used to halt the growth of 
voids containing several zones. I will come back to this issue after 
discussing the statistical significance of voids, which will be useful 
for defining their edges. 

This way of defining voids can lead to surprising void topolo- 
gies and shapes. For example, if a set of particles consists of a 
clump surrounded by a low, uniform-density background, every- 
thing but the clump will be detected as a void. Also, even a single 
low-density particle along a wall between two visually apparent 
voids might cause ZOBOV not to detect them separately, but instead 
to detect a single, dumbbell-shaped void. However, many (about 
16) particles directly participate in each particle's density estimate. 
Thus, such a hole in the wall between voids would have to be a con- 
spiracy of many particles, and would likely look like a significant 
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Figure 1. (a) Galaxies ICroton et"al] |2005, down to B = —10) from a 40 X 40 X 5 Mpc) 3 slice of the AAVFCP region. The outer boundary is 

45 h _1 Mpc-square. The slice is the same size as the dark-matter illustration in Fig. [6] but is at an edge of the central 40-h~ 1 Mpc cube, not at the centre. 
It was chosen because the voids in this figure are less well-defined, and thus richer in structure, (b) The 2D Voronoi tessellation of galaxies in this slice, 
with each particle's Voronoi cell shaded according to its area. The galaxies outside the inner (40 h~ 1 Mpc) boundary are shown because they contribute 
to the tessellation, (c) Zones of galaxies. The cores (density minima) of each zone are shown with crosses, the different colours merely demarcate different 
zones, (d) The growth of void 1, the deepest void in the sample. With analogy to a water tank, the water level (density) is increased, and zones the water 
runs into are added to the void. Colours from dark to light indicate the stage at which the zone is added to the void. The darkest colour is the original zone, 
the next-darkest is the first zone or set of zones added, etc. The only zone that is never included is that with the highest-density link to another zone, in the 
lower-right corner. A measure of the probability that each zone-adding event leads to a void that did not arise from Poisson noise is shown in Fig. [4] 

hole by eye, as well. ZOBOV operates under an implicit assumption 2.4 Statistical significance of voids 
that the discreteness noise is similar to that in a Poisson density- 
sampling, and ZOBOV could give surprising results if particles are The probability that a void v is real is judged according to its den- 
carefully arranged to fool it. sity contrast, i.e. the ratio r (v) of pi (v), the minimum-density par- 
ticle on a ridge beyond which is a deeper void, to v's minimum 
density, p m i n . This is not the only conceivable way to judge the 
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256' particles, Poisson 

128 3 particles, Poisson 
AAVFCP Millennium subbox 
Poisson fit 




J 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.1 

Density contrast r 



Figure 2. The cumulative probability function P(r) of the ratio r(v) be- 
tween the lowest density of a zone and the density at which water would leak 
into an adjacent zone that is deeper. The solid and dashed curves show P(r) 
for uniform-density Poisson processes using 256 3 and 128 3 particles. The 
curve has the same shape as the number of particles increases. The dotted 
curve shows the fit in Eq. {T}. The dot-dashed curve shows P(r) for voids 
in the region analysed in the Aspen-Amsterdam Void-Finder Comparison 
Project (AAVFCP). 



significance of a void. But it is simple, and the probabilities it re- 
turns roughly align with what visual inspection would suggest. 

The density contrast r is converted to a probability by compar- 
ing to a Poisson particle distribution. Several statistical properties 
of Voronoi diagrams applied to Poisson-sampled uniform density 
distributions are well-understood. For example, the distribution of 
Voron oi cell volu mes is well-approximated by a gamma distribu- 
tion dKiandll966l) . and the average number of Vorono i neighbours 
(48tt 2 /35 + 2 « 15.54), is even known analytically dOkabe et al.l 
2000). Unfortunately, the distribution of contrasts of density de- 
pressions in a Poisson Voronoi diagram is not known. It seems 
difficult to model analytically or from known results, since each 
depression has an unknown number of particles, whose estimated 
densities depend on each other in a complicated way. Therefore, 
for ZOBOV, this distribution is measured in a Monte-Carlo fashion 
from Poisson sampling. 

Let the cumulative probability P(r) be the fraction of voids 
in a Poisson particle distribution with density contrast greater than 
r. Figure [2] shows P{r) as a function of r for two cubic Poisson 
simulations (assuming periodic boundary conditions), with 128 3 
and 256 3 particles. It also shows the following fit to P(r): 

P(r) = exp[-5.12(r - 1) - 0.8(r - l) 2 ' 8 ]. (1) 

This P(r) gives an estimate of the likelihood that a void with den- 
sity contrast r could arise from Poisson noise, i.e. that it is fake. 
Table Q] shows density contrasts corresponding to the first seven 
'sigmas' (with analogy to a Gaussian distribution), calculated us- 
ing this fit. The fit may be trusted to roughly r = 3, beyond which 
there is no Poisson data. In NGH, we found that the analogous sig- 
nificance measure for haloes seems to lose its meaning at some 
point between the 4- and 1-a level anyway. That is, 7-er haloes are 
not visibly more robust than 4-<r ones. 

Figure|2]also shows the cumulative distribution of the density 
contrast r for a region of Millennium simulation, discussed in Sec- 
tion(3j results for this region are listed in TableQ] as well. Possibly, 
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Table 1. Void abundances for various density contrasts r in a Poisson par- 
ticle simulation, and in the AAVFCP region. With analogy to a Gaussian 
distribution, the first two columns list the levels of probability correspond- 
ing to different cr's. The third column (r) gives the density contrast with 
abundance P(r) in a Poisson simulation, calculated using Eq.fT] The fourth 
column gives the number of voids exceeding density contrast r in the 40- 
h~ 1 Mpc AAVFCP region. The last column adds the constraint that the 
minimum density of the void p m i n < 0.2, in units of the mean density. The 
last two columns are illustrated in the top panel of Fig. [8] 



a natural place to stop accepting voids as real in this data set would 
be where r goes below the curves' intersection, at r ~ 1.5. How- 
ever, this occurs at about a 2-a level, which seems quite low. 

In the 256 3 -particle Poisson simulation, ZOBOV detected 
335025 voids; thus, the average number of particles in a zone is 
50.1. This also means that one out of every 50.1 particles is a den- 
sity minimum. The average number of particles in a void is greater 
than 50.1, though, since a void may be comprised of many zones. 

The 2D version of Eq. (TJ is 



P(r) = cxp[-2.6(r - 1) 



(2) 



This fit is based on a rather small set of 256 uniformly Poisson- 
distributed particles. It only extends to P(r) ~ 10~ 3 (about 3 a). 
This fit gives r — 1.44, 2.19, 3.3, 4.7, 7, 9, and 11 for significance 
levels of l-7cr. 

In addition to the statistical probability criterion, there is a 
simple physical criterion to use. The natural dark-matter density 
associated with a top-hat void that has undergone spherical expan- 
sion is Pvoid ~ 0.2 (hereafter, densities are assumed to be in units 
of the mean density) at redshift 0. Because the densities of galaxies 
and dark matter differ in general, this criterion may be inappropri- 
ate to use for galaxies. This could be incorporated into the signifi- 
cance measure, for example by multiplying the probability the void 
is fake statistically by the probability of getting its core-particle 
(minimum) density p m in in a Poisson Voronoi diagram with den- 
sity 0.2. However, in the AAVFCP sample, all ZOBOV voids statis- 
tically significant at the > 3<r level have core densities p m in < 0.2 
anyway (see Fig. [8}. Also, the population with p m i n < 0.2 is quite 
distinct; there are few voids close to p m in = 0.2. So, a simple cut- 
off at p m in = 0.2 may suffice as a physical criterion, redundant if 
only large-significance voids are used. 



2.5 Defining the edges of voids 

As noted above, the deepest ZOBOV void in a set of particles will 
encompass all zones except the one with the highest-density ridge 
separating it from other zones. There are (at least) three ways to 
deal with this situation. 

The first option is to do nothing further. The raw ZOBOV re- 
sults would then consist of a large void, with several subvoids (and 
sub-subvoids, etc.) of varying significance levels. A zone can be- 
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long to multiple voids and subvoids. This option could be appeal- 
ing in its simplicity, and is well-suited to the physical hierarchy that 
voids are thought to have in the Universe (jDu binski et aL 1 19931 : 
ISheth & van de Weygaertl |2004 iFurlanetto & Piranl 12006^ How- 
ever, the following two options likely produce more practically us- 
able sets of voids. 



2.5.1 Specifying a significance level 

The second option is to excise subvoids exceeding a particular sig- 
nificance level (e.g., 5-a) from parent voids. If a subvoid is removed 
from a void, then all zones which join the parent void in the same 
accretion event as that subvoid, or in subsequent ones, are also re- 
moved. This option is a natural choice if disjoint voids are desired, 
which is traditionally the case. 

Figure[3^ shows the result of this procedure, applied to the set 
of particles in Figure[T] using a 2-a threshold of r = 2.19. Accord- 
ing to Eq.[2] voids 1 and 2 are significant at the 4-cr level, and voids 
3 and 4 are significant at the 3- and 2-a levels. The darkest regions 
belong to no void over 2a. 



2.5.2 Determining the most probable extent of voids 

The third option is to use the density contrasts of voids and sub- 
voids to define a most-probable extent of voids. Suppose a zone 
z has a sequence of extents, v%. For example, Fig.[TJi shows the 
various possible extents for void 1 in that particle set. 

At each zone-adding event, define a significance Si . The sig- 
nificance of zero zone additions So = P[r(z)], the probability of 
zone z's density contrast r(z) arising in a Poisson particle distri- 
bution. Note that the density ratios r(z) = pi(z)/p m i u (z) and 
r(v) — pi(v) / pmi n (z) can differ. pi(v) is the lowest density 
among ridge particles linking z to a deeper zone (perhaps with a 
path through other zones), while pi (z) is the lowest density among 
ridge particles linking z to any of its neighbouring zones. 

Call the void after the ith zone-adding event w, (vo = z), and 
call the set of zones to be added in the (i + l)st zone-adding event 
Zi+i — {zi+i,j}. To judge the wisdom of the [i + l)st addition, 
compare the probability that Vi and all of the zones in Zi+i are 
individually fake to the probability that their union, Vi+i = Vi + 
Zi+i, is fake. Given Si, define Si+i as 



= Si 



P[r(vi)] Ui PW*»U 



(3) 



Here, r(vi) — pi(vi)/p m i n (z), where pi(vi) is the the lowest den- 
sity among particles on the ridge separating w, from the set of 
prospective new zones Zi+i. If the new zone set Zi consists of 
other entire voids (i.e. sets of zones) with subvoids, only the voids, 
and not the subvoids, enter the product in the denominator. 

For example, zone 3 in Figure [TJ:, and its two neighbour- 
ing zones below and to the left of it (call them 3' and 3"), 
are separated by an insignificant (below 1-a) density ridge, un- 
detectable by eye in the original particle set. The probability 
that all three are fake (separately arose from Poisson noise) is 
P[r(3)]P[r(3')]P[r(3")] = P(1.28)P(1.11)P(1.06) = 0.30, 
using Eq. l|2}. The probability that their union is fake is P[r(3 + 
3'+3")] = P(3.57) = 0.0013. Since the latter is rarer, the union is 
favoured statistically. In the Si notation, these probabilities are nor- 
malized differently; to get the expressions in this paragraph, multi- 
ply through by the probabilities in the denominator of Eq. l[3}. 




Figure 3. Two strategies for detecting the edges of highly significant voids, 
as applied to the particle set in Fig. [T] In (a), discussed in Section 12.5.11 
the user chooses a significance level at which to accept a void (here, 2-cr). 
Voids exceeding this threshold stop growing when they encounter another 
void exceeding this threshold. Particles in the darkest regions belong to no 
void over 2 a. In (b), discussed in Section 12.5.21 a most-probable extent 
is found for each void. Zones are coloured according to their significance. 
Zones in the deepest, 4-cr void are lightest; zones included in 3-, 2-, and 
1-<t voids are coloured increasingly darkly. Particles in the darkest region 
belong to no void over 1 a. The hatched regions are 1-<t subvoids. 



Figure |4] shows the significances of the various possible ex- 
tents for the three large voids in Fig. [7] It shows the first dip in void 
3's likelihood of fakeness when the first group of zones is added, 
discussed in the previous paragraph. The second (and last) prospec- 
tive addition, the last one before a deeper zone is encountered, gives 
an upturn in void 3's curve. Thus, the last addition is not favoured 
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Figure 4. Significances of various extents of the three voids in Fig.[T]that 
encompass more than their central zone. The extents at the minima of these 
curves are shown in Fig. [5J>, with the exception of void 1. For void 1, the 
extent shown in Fig.[5p has a number of additions i = 3, the minimum of 
the curve excluding the high-i ramp into high-density regions. 



statistically. However, using the method of Section l2.5.1l this extra 
zone is included in zone 3. 

Void 1, the deepest void, has the longest curve in Fig. [4] Its 
probability of fakeness reaches a local minimum after the third ad- 
dition of zones. In accord with intuition, the curve then increases 
again, but then as the densest zones in the figure eventually get 
included, the density contrast grows sharply, making the curve 
plunge. ZOBOV is detecting everything except the dense points in 
the lower-right corner as a highly significant void. To prevent voids 
from growing into haloes, a density limit for links between zones 
may be set. For dark matter, a natural value for this limit would be 
Pi, max = 0.2. Alternatively, one might simply accept the lowest 
minimum before the ramp downward at the end. 

Figure [3p shows most-probable ZOBOV void extents for the 
2D particle set. Zones are coloured according to their significance 
level. Zones in the deepest, 4-a void (defined using three zone- 
addition events) are lightest; zones included in 3-, 2-, and 1-ct voids 
are coloured increasingly darkly. Particles in the darkest region be- 
long to no void over 1 a. The hatched zones are l-cr subvoids within 
larger voids. All zones are actually subvoids, but most of them do 
not pass the 1-<t level. 

2.6 Selection functions, boundaries, and holes 

Observational effects complicate the application of ZOBOV to real 
data, e.g. from a galaxy redshift survey. This section contains some 
speculations about how to deal with these effects. 

ZOBOV can naturally accommodate a selection function 
that varies with position, 4>(x). All one needs to do is to 
divide the density of each pa rticle/galaxy at Xi by cj)(xi) 
( Ivan de Weygaert & Sc haap 2008). For void finding, densities es- 
timated with the DTFE (defined between particles) may in general 
be preferable to what ZOBOV uses, the VTFE (defined at particles). 
However, a variable selection function is more natural to correct for 
using the VTFE. 

ZOBOV is designed for a periodic simulation, but other bound- 
ary situations can be handled. If an isolated set of particles is anal- 
ysed without any modifications, ZOBOV will still correctly deter- 



Figure 5. A Mpc-wide slice through the the inner 40- 

h~ 1 Mpc cube analysed for the AAVFCP. This and the following fig- 
ure were produced using Nick Gnedin's IFRIT software, available at 

|http : //home ■ fnal ■ gov/~gnedin/IFRIT/| 



mine the adjacencies of each particle. However, particles on the 
edge could have arbitrarily large Voronoi volumes, and thus many 
spurious density minima will occur on the edges. A trivial way of 
preventing this is to set all edge particles' densities to a value higher 
than any density in the interior. Another way is to add a buffer zone 
of particles at (for example) the mean density around the dataset. 
This will inhibit edge effects for densities estimated for particles a 
bit below the surface, as well. However, there is some ambiguity in 
how to make this buffer. 

Holes and significantly non-convex boundaries in the data 
are perhaps the most difficult problem for ZOBOV. A sim- 
ple way of dealing with the problem might to put parti- 
cles in the holes, Poisson-sampling at the mean density in- 
side them. The density could also be interpolated among neigh- 
bouring particles, perhaps an iterative process. Or, perhaps op- 
timally, it could be estimated through a constrained realization 
llBertschingedll987l; iHoffman & Ribak 199ll; IZaroubi et alJI 19951: 
Ivan de Weygaert & Bertschinged 1996). With any sort of Poisson 
hole-filling, it would be wise to try several realizations. 



3 RESULTS 

Here I discuss the application of ZOBOV to dark-matter particles 
from a cube 60-h^ 1 Mpc on a side (hereafter, the 'full cube'), 
taken from the Millennium simulation. Aspects of these results in 
the inner 40-/i _1 Mpc cube (hereafter, the 'inner cube') can be 
found, with direct comparisons to other void finders, in the Aspen- 
Amsterdam Void-Finder Comparison Project (AAVFCP) paper. To 
reduce the dependence on boundary conditions, only voids with 
cores (minimum-density particles) in the inner cube were analysed 
in the AAVFCP. 

ZOBOV is designed for a periodic simulation. For the non- 
periodic AAVFCP cube, a square lattice of particles at the mean 
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Figure 7. Significances of possible void extents for the void shown in Fig. 
[6] The most-probable extent (a minimum on the curve) is at i = 2 using the 
original fit to the void probability function, Eq. (TJ. To achieve a minimum 
at i = 200 (the extent shown in the bottom panel of Fig. [6), a steeper fit 
such as Eq. (4) must be used. The simulations used for Fig.|2]are too small 
to probe the abundance in Poisson simulations of ZOBOV voids with as high 
density contrast as this void. 




Figure 6. The largest, and most significant, void VOBOZ found the inner 
40- h~ 1 Mpc AAVFCP cube. Green, diffuse particles are in the void; black 
particles are not. The large red dot is the core (minimum-density) particle 
in the void. The top panel shows the full void. The bottom panel shows the 
void as truncated as described in Section l2.5.1l using a significance level of 
5-<t; it is the same as the most-probable extent of the void as described in 
Section l2.5.2l if the 'steeper fit' in Fig. [7] Eq. @, is used. 



density was added to each face of the full cube, quite far away from 
the inner cube (where the results are actually analysed). 

ZOBOV detected a couple of orders of magnitude more voids 
in the AAVFCP region than almost any other void finder. This is 
because of the many low-significance voids, and subvoids, it de- 
tected; see TableQ] and Figure[8] The number of 5-a voids (29) is 
typical of the number of voids detected by other void finders. 

Figures[5]and[6]show the largest, and most significant, ZOBOV 



void that has a core particle in the inner AAVFCP cube. There was 
actually a deeper void in the full cube, but its minimum-density 
particle was on the outer edge, perhaps an artefact of the bound- 
ary conditions used for the tessellation. It is this void, not the void 
shown in the figures, that encompasses nearly the whole volume, 
as the most-significant ZOBOV void typically does. 

The bottom panel of Fig. [6] shows the void in the top panel, 
truncated as in Section |2.5.1| with a 5-a probability threshold. It is 
also the most-probable extent of the void as described in Section 
12.5.21 with a caveat. Figure UJ shows the extent-significance curve 
for this void, using two different fits of density-contrast vs. proba- 
bility. The solid curve uses the original fit, Eq. |QJ, for P(r) in Eq. 
l[3}. The minimum of this curve (showing the most-likely extent) is 
at i = 2, giving a tiny region around the central zone. There are two 
explanations for this discrepancy between what ZOBOV and the hu- 
man eye pick out: density contrast alone is an inadequate quantifier 
of void significance (as judged by the human eye); or, the fit to the 
probability of void fakeness in Eq. Q} is inaccurate at high r. 

Even using Eq. |QJ, there is a sharp increase in the curve at 
i — 200; the void extent at this point is shown in Fig. [6] For this 
to be returned as the most-probable extent, the probability of void 
fakeness must be dramatically reduced by a factor of e 351 at r = 
4.5 (the density contrast reached at i — 200). A steeper fit that 
achieves this is 



P(r) = exp[-5.12(r - 1) - 0.8(r - l) 4 ' 7 ], 



(4) 



the fit used for the dashed line. Unfortunately, the Poisson simu- 
lation used for Fig. [2] is not large enough to test the probability of 
such rare, high density contrasts. 



3.1 Lagrangian density distribution 

The top panel of Figure[8]is a scatter plot of density contrast r ver- 
sus the core (minimum) density p m i n , for voids in the AAVFCP 
region. There are two clusters of points: one for p m in < 0.2, about 
the natural density of a void in ACDM; and a second at high den- 
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Figure 8. Top. For ZOBOV voids in the 40-Zi" 1 Mpc AAVFCP region, a 
scatter plot of the minimum density p m im and the density contrast r (the 
ratio of pi , the density at which a void would merge with a deeper void, and 
Pmin)- This plot shows two populations, one that satisfies the 'physical' 
significance criterion, p m j n /p < 0.2, and another that does not. The high- 
density population contains only one void above 3o\ whereas 10 would be 
expected from 3765 Poisson voids. 

Bottom. Probability density functions (PDF's) of particle densities in the 
AAVFCP region. The dashed black curve shows the PDF from particles in 
the full 60-/i~~ 1 Mpc box; the dotted blue curve uses only particles in the 
inner 40-Zi" 1 Mpc box. The inner box has fewer haloes per unit volume; 
this explains the lower high-density peak, and higher low-density peak, in 
the inner box. The solid green curve shows the PDF of void minimum- 
density particles. Peaks in the PDF using the full particle set (e.g. the dashed 
blue curve) seem to give sharper peaks in the PDF of void core-particle 
densities (the green curve) at slightly smaller densities. 



sity, p m in ~ 10 2 ' 5 . At first, the high-density group may be sur- 
prising, but all of these voids have low density contrasts. Only one 
of them barely passes the 3-a level, even fewer than the expected 
number (10) of 3-a objects in a sample of 3765; this is the number 
of voids with p m in > 0.2. All highly significant voids above ~ 3-<r 
in density contrast are also physically significant, with p m in < 0.2. 
This lends credence to both significance measures. 

The high-density cluster in the top panel of Fig. [8] appears to 
be related to the high-density peak at p ~ 10 2 ' 5 in the probability 
distribution P(p) of particle densities, shown in the bottom panel. 
This peak is at approximately the fiducial density of virialization, 
Pvir ~ 200, so the particles in this peak typically reside in col- 
lapsed structures. The high-density peak in P(p) is smaller in the 
inner cube than in the full cube, and vice-versa for the low-density 
peak at p ~ 1. This makes sense, since haloes are scarce in the 
inner cube, most of which is occupied by a large void. 

This P(p) is approximately a Lagrangian version of 
the Eulerian counts in cells (C IC) statistic Pcic {N, V) (e.g. 
ISzapudi. Meiksin & Nichol Il996l) , which measures the distribu- 
tion of the numbers of particles N in fixed grid cells of volume 
V. Roughly, P(p) oc pPcic(N = pV,V), since, for exam- 



ple, each cell containing three particles will be counted once for 
Pcic(N = 3, V), but thrice in P(p = N/V). CIC measurements 
do not have a high-density peak, but they often have a significant 
high-density tail that, when multiplied by a factor of p (or TV), may 
produce a peak. It would be interesting, but beyond the scope of this 
paper, to model this high-density peak in P(p) using, for example, 
the halo model. 



4 DISCUSSION 

ZOBOV has a few unique, appealing features, that I believe are 
worth keeping in mind as cosmologists develop a standard defi- 
nition of voids. These features are: 

• Parameter-independence. The set of voids ZOBOV returns for 
a set of particles depends on no parameters, based on a simple defi- 
nition of a void: a depression around a density minimum. However, 
the word 'depression' is also a bit vague; its definition for ZOBOV 
is essentially the first few paragraphs of Section [2] These imple- 
mentation choices are, in a sense, parameters. 

• Statistical-significance measurement for voids. Void finding is 
not a clear-cut business, so ZOBOV does not return a clear-cut set 
of results. Instead, it measures a probability that each void is real, 
based on how likely the void's density contrast occurs in a Poisson 
realization. Thus approach, I believe, is philosophically satisfying, 
but the raw results it returns are not necessarily straightforward to 
analyse. To get a definitive set of disjoint voids, one can set a sig- 
nificance level at which to trust that a void is real. Alternatively, 
ZOBOV has a mechanism to determine the most-probable extents 
of voids. There could be ways of analysing the raw, parameter- 
free ZOBOV results, as well. For example, a void probability func- 
tion measurement could include all voids, but weight them by their 
probability of being real. ZOBOV is not the only void-finder that 
emplo ys a statistical-significance test (e.g. iKauffmann & Fairalll 
Il99ll) . 

• Hierarchical voids. Just as haloes contain subhaloes, voids 
contain subvoids. ZOBOV naturally accommodates this fact, detect- 
ing subvoids as well as voids. Again, a hierarchy of voids does not 
lend itself to straightforward analysis using traditional methods, but 
methods could be devised which take advantage of this hierarchical 
information. 

There are also some areas that could benefit from further study 
or improvement: 

• Using the (dual) Delaunay instead of the Voronoi tessel- 
lation for density estimation. The DTFE and VTFE (based on 
these two tessellations, respec tively) give natural density esti- 
mates from a set of particles I Schaap & van de Wevgaer J l20QOl: 
IPelupessv. Schaap & van de Weyg aert 20031; ISchaapl 120071) . WVF 
jPlaten et alj 120071) . for instance, uses the DTFE instead of the 
VTFE. Both the DTFE and VTFE have no free parameters, and have 
infinite spatial resolution, up to machine precision. Arguably, they 
give the most local possible density estimates with meaningful in- 
formation. The VTFE defines densities at each particle, and thus is 
natural for finding maxima in a set of particles. This is why we 
used it for the halo finder VOBOZ. However, the DTFE is a more 
natural choice for finding minima, since it defines densities in cells 
between particles. For a well-sampled density field as in an A^-body 
simulation, the differences are likely negligible, but for sparse (e.g. 
galaxy) particle samples, ZOBOV should ideally use the DTFE. On 
the other hand, the VTFE could be preferred when faced with a vari- 
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able selection function, which it handles more naturally than the 
DTFE does. 

• The definition of statistical significance for a void. ZOBOV 
judges the statistical significance of a void v by the contrast be- 
tween the lowest density on a ridge beyond which is a deeper void, 
and v's minimum density. This definition is simple and easy to cal- 
culate, and the probabilities it returns compare favourably to what 
visual inspection suggests. 

However, there are other possible significance measures. For ex- 
ample, the algorithm could be run several times on different Monte- 
Carlo realizations of the density field, formed by moving ('jitter- 
ing') all particles around in some fashion corresponding to the 
noise in the system. If the limiting noise is from particle discrete- 
ness (which is not usually the case for actual data), a natural way 
to jitter the particles would be to move each particle to a random 
place in its initial Voronoi cell. For TV-body simulations, a jitter ac- 
cording to a measure of the spatial resolution (e.g. the gravitational 
softening length) might be more appropriate. For 3D galaxy red- 
shift surveys, the main uncertainties are probably the distances in- 
ferred from redshifts, and the handling of boundary conditions and 
holes. For a large data set like an TV-body simulation, this Monte- 
Carlo approach would probably take prohibitively long, but for a 
more manageable dataset like a galaxy catalogue, estimating sig- 
nificances in this way might be tractable. 

Another finding that emerged from this study is a broad, high- 
density peak in the logarithmically binned probability distribution 
of dark-matter particle densities, at p « 10 p, as shown in Fig. [8] 
It would be interesting to see whether this feature can be modelled 
successfully using the halo model of large-scale structure. 

The code for ZOBOV, packaged with the 
halo-finding algorithm VOBOZ, is available at 
|http : //if a . hawaii . edu/~neyrinck/voboz| The 
new edge-detection methods developed for ZOBOV, described 
in Section |231 make VOBOZ more attractive than previously for 
finding clusters in general point sets, such as galaxy catalogues. 
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